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We use techniques from the statistical mechanics of disordered systems to analyse the properties 
of Nash equilibria of bimatrix games with large random payoff matrices. By means of an annealed 
bound, we calculate their number and analyse the properties of typical Nash equilibria, which are 
exponentially dominant in number. We find that a randomly chosen equilibrium realizes almost 
always equal payoffs to either player. This value and the fraction of strategies played at an equi- 
librium point are calculated as a function of the correlation between the two payoff matrices. The 
picture is complemented by the calculation of the properties of Nash equilibria in pure strategies. 
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Game theory seeks to model problems of strategic decision-making arising in economics, sociology, or international 
relations. In the generic set-up, a number of players choose between different strategies, the combination of which 
determines the outcome of the game specified by the payoff to each player. Contrary to the situation in ordinary 
optimization problems, these payoffs are in general different for different players leading to a competitive situation 
where each player tries to maximize his individual payoff. One of the cornerstones of modern economics and game 
theory is therefore the concept of a Nash equilibrium || , see also , which describes a situation where no player can 
unilaterally improve his payoff by changing his individual strategy given the other players all stick to their strategies. 
However this concept is thought to suffer from the serious drawback that in a typical game-theoretical situation there 
is a large number of Nash equilibria with different characteristics but no means of telling which one will be chosen by 
the players, as would be required of a predictive theory. 

This conceptual problem already shows up in the paradigmatic model of a bimatrix game between two players 
X and Y where player X chooses strategy i £ (1 . . . N) with probability Xi > and player Y chooses strategy 
j G (1 . . . N) with probability yj > 0. The vectors x = {x\, ...,xn), y = (yi, ...,2/jv) are called mixed strategies and 
are constrained to the (N — l)-dimensional simplex by normalization. For a pair of pure strategies (i, j) the payoff to 
player X is given by the corresponding entry in his payoff matrix ay whereas the payoff to player Y is given by bij . 
The expected payoff to player X is thus given by ^(x, y) = ■ Xia^yj and analogously for player Y. Every player 
has the intention to maximize his own payoff. A Nash equilibrium (NE) (x, y) is defined by 

u x (x,y) = max x / ^(x',y) 

^( x ,y) = max y / i/j,(x,y') (1) 

since in this situation there exists no mixed strategy x which will increase the expected payoff to X given Y does not 
alter his strategy and vice versa for Y. Thus no player has an incentive to unilaterally change his strategy. 

Apart from applications in economics, politics, sociology, and mathematical biology, there exists a wide body of 
mathematical literature on bimatrix games concerned with fundamental topics such as exact bounds for e.g. the 
number of NE and efficient algorithms for locating them || . For games even of moderate size a large number of 
NE are found, forming a set of disconnected points. In general the different NE all correspond to different payoffs to 
the players. 

However many situations of interest are characterized by a large number of possible strategies and complicated 
relations between the strategic choices of the players and the resulting payoffs. In such cases it is tempting to model 
the payoffs by random matrices in order to calculate typical properties of the game. This idea is frequently used in the 
statistical mechanics approach to complex systems such as spin glasses [0^] , neural networks , evolutionary models 
p0| , or hard optimization problems Recently this approach has been used to investigate the typical properties 
of so-called zero-sum games [[l2| . 

In this Letter we investigate the properties of Nash equilibria in large bimatrix games with random payoffs, i.e. 
with random entries of the payoff matrices. Using techniques from the statistical mechanics of disordered systems 
we estimate the number of NE with a given payoff. We find that the NE are exponentially dominated in number by 
equilibria with a certain payoff. 



* Unite Mixte de Recherche du Centre National de la Recherche Scientifique et de l'Ecole Normale Superieure 



1 



In this approach characteristics of the game are encoded in the distribution of payoff matrices - with only a few 
parameters - instead of the payoff matrices themselves. Clearly the choice of the probability distribution to be 
averaged over can influence the results. However a number of simplifying observations may be made: the two payoff 
matrices may be multiplied by any constant or have any constant added to them without changing the properties of 
the game in any material way. Thus there is no loss of generality involved in considering payoffs of order TV -1 / 2 and 
of zero mean. We assume that the entries of the payoff matrices at different sites are identically and independently 
distributed. In the thermodynamic limit one finds that only the first two moments of the payoff distribution are 
relevant so the entries of the payoff matrices may be considered to be Gaussian distributed. The only property of the 
distribution of payoffs which is not fixed by these specifications is the correlation k between entries of the same site 
of the two payoff matrices. We thus choose the entries of the payoff matrices to be drawn randomly according to the 
probability distribution 

Pdaij}, fa}) = II ^JfT^ ex P ■ ( 2 ) 



i.e. a Gaussian distribution with zero mean, variance l/N and correlation ay&fc/ = n5ik5ji/N for all pairs and 
(k, I). Here and in the following, the overbar ~ denotes the average over the payoff distribution 

Thus for k = — 1 one finds a Dirac-delta factor S(aij + bij) in (||) corresponding to a zero-sum game. K = 
corresponds to uncorrelated payoff matrices and k = 1 gives the so-called symmetric case = bij where the two 
players always receive identical payoffs. The parameter k consequently describes the degree of similarity between the 
payoffs to either player and may be used to continuously tune the game from a zero-sum game to a purely symmetric 
one. The former admits no cooperation at all between the players. Negative k correspond to an on average competitive 
situation, whereas for increasing k there are more and more pairs of strategies which are beneficial to both players. 

As a starting point for the statistical mechanics calculations, we remark that ([!]) is equivalent to the set of inequalities 

^2 a ijVj - v x (x, y) < Vi 

3 

^2 x fa - ^v( x ' y) < o Yj . (3) 

i 

Due to the non-negativity of the probabilities Xi and yj this directly results in the local equations specifying a NE 
with payoffs v x and v y 



Xi ", ,//, - /', j = ( V , aijUj /', ) • Vi 

(^2 ■''•'>■ j - v v^ 



|^Z^ x i b ij - vyj = (Ei x ihj - vy) < Vj . (4) 

We introduce real- valued variables x and y with Xi — Xi<d(xi) (where Q(x) is the Heaviside step- function) and likewise 
for yj . Then the last conditions (Q) may be written as 

!f ( x ,y) = XiQ(-Xi) - (E,- -v x )=0 



Ij(x,f) = Vi®{-Vj) - (E,- xSixijbij - v v ) = , (5) 



J 3 

i.e. we have constructed indicator functions If and If which are zero at a NE (x, y) and non-zero elsewhere [[l3] 
The number j\f(v Xl v y ) of NE with payoffs v x and v y may thus be calculated from 



a(i x ,F) 



9(x,y) 



, (6) 



where the mixed strategies have been rescaled to Ei x i = Ej Vj = N for convenience. Although we expect an 
exponential number of NE, so lnA^ is an extensive quantity, we calculate the so-called annealed average In AT instead 
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of the quenched one In J\f . This gives an exact upper bound to the typical number of NE at given payoffs FL Using 
integral representations of the delta functions in (^), averaging over the disorder (||), and introducing the order 
parameters q x — N^ 1 xfQ(xi), R x = N^ 1 JT iXiXi&(xi), and similarly for player Y we obtain 

f dn x ' y dn x ' y P dE x ' y [ dR x ' y 

Ar< "- = / tMf / pf / exp{iiV(,,r + q " 4 " - KR ' R " + E " + E " )} 



-i^2xiXiQ(-Xi) - iv x - i^ a, ^i i 0(a;i) + (a; «-> y with -> # x ) \ \\ det{B)\\ 

i i i J 



(7) 



with H(x) = f°°dx' exp(— x 12 /2)/^/2tt. Here we have used the assumption that the normalizing determinant 
|| det(-B)|| = || ^(x'^-)" 1 II i s self- averaging and is effectively uncorrelated with the rest of expression (0) and may therefore 
be averaged over the disorder independently of the rest of the expression G| 0. We obtain 



det(B)|| = expj ^(p x lnp x - p x + p y lnp y - p y )\ , 



(8) 



where the fraction of strategies played with non-zero probability p x — N^ 1 ^T Q(xi) is determined self-consistently 
by introducing p x , p y and their conjugates p x , p y in (0). After the transformation iq x,y — > q X:V and similarly for 
E x ' y ,p x ' y , R y , the integrals over x, x,y,y may be performed. In the limit of large payoff matrices N — > oo, the 
integrals over the order parameters are dominated by their saddle point. Furthermore (|7|) is now symmetric under an 
interchange of the players, and the maximum (and hence exponentially in N dominating) number of NE is found at 
equal payoffs to each player. Thus it makes sense to restrict the analysis to the case v x = v y = v. We obtain 

qq kR 2 p 
~2 2 2 

f iP- (K - KFt,s/n"\ 2 } ( F! — ie.Ru/n \\ 

(9) 



S K (v) = — lnW(V, v) = 2 extr q ^R t E tP 



m I h i — '— i + w— ex P (_£ + t:f:'f ] h ( E - ^ 



yqq + K 2 R 2 ^{ 2q 2q + 2n 2 R 2 /q J y^T^Wfq J 

From this expression the statistical properties of the entire spectrum of player-symmetric NE may be deduced. In the 
thermodynamic limit, a randomly chosen NE will give the payoff v = u y = v x = argmaxS' K (^) with probability one, 
because the number of NE with this payoff N(v, v) = exp{7Vmax5' K (y)} is exponentially larger than the number of all 
other NE. Similarly, the self-overlap q — ^ xf = JT x ± an d the fraction of strategies played with non-zero probability 
p — Px — Py will take on their saddle-point values of (|9j) evaluated at the maximum of S K (v) with probability one. 



1 The annealed approximation does not differ by more than 3% in the entropy and by 10% in the order parameters from the 
results of the more complicated replica calculation of the quenched average, which will be presented elsewhere. 

2 This assumption ha been verified by explicitly including the calculation of the effective normalizing determinant into the 
disorder average. 
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FIG. 1. The entropy S K of the NE which exponentially dominate the spectrum S K (v)- The analytic results are compared 
with numerical simulations for N — 18 averaged over 100 samples. 

Figure [l] shows the entropy S K at the maximum of S K {v) as a function of k. The numerical results were obtained 
by enumerating all NE for N=18 [^Jl^]. Given that enumeration is only possible for small sample sizes, the agreement 
between the annealed approximation and the numerical results is quite good. 

Figure |^ shows the payoff v and the fraction p of strategies played with non-zero probability a gain as a function of 
k. The numerical results were obtained for N=50 by using the iterated Lemke-Howson algorithm |^||, which locates 
a single but typical NE. The deviation between analytic and numerical results may be accounted for by the use of the 
annealed calculation and finite-size effects. 
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FIG. 2. The payoff v (solid line) and the fraction p of strategies played with non-zero probability (dashed line) . The analytic 
results are compared with numerical simulations for N = 50 averaged over 200 samples. 

For zero-sum games (k = —1) there is only a single NE Jl^] so S K —o = 0, and the payoff is zero due to the symmetry 
between the players. As K is increased the number of NE rises. The maximum of the typical number of NE is reached 
for the case of symmetric games, where S K= i ~ 0.362. This result may be compared with a rigorous upper bound 
derived using geometric methods [j5|,^| , which states that for any non-degenerate N-by-N bimatrix game with large iV 
there are at most e°' 955N equilibrium points. Thus the typical-case scenario investigated here does not saturate this 
bound. 

The increase of v with n may be understood as follows: At increasing k the outcome of a pair of strategies 
which is beneficial to player X say, tends to become more beneficial to player Y. As a result players focus on these 
strategies and the payoff to both players rises. By the same token the fraction p of strategies which are played with 
non-zero probability at a NE decreases with k and the self-overlap of the mixed strategies q increases. 

Even though the properties of the NE at a given k are dominated by the peak of S K (v), there is an important 
qualitative difference between the curves of different values of k. For — 1 < k < k c ~ —.59, S K (v) reaches its 
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maximum and then decreases crossing the S = axis at some point. For re > re c however, S R (y — > oo) — > 0+ and 
p — > 0+. Hence there is an exponential number of NE offering an arbitrarily large (but finite) payoff to either player. 

This observation can be corroborated by considering the extreme case of pure strategy Nash equilibria (PSNE) (i, j) 
where each player only plays a single (pure) strategy i.e. Xi = N5i>i, yj — NSj>j, Vi',f — 1,...,N. According 
to (|J), a PSNE corresponds to a site (i, j) which is simultaneously a maximum of the column a^j and a maximum of 
the corresponding row of biy . The average of the number M. of PSNE is thus given by 



m = e n - °«) n - 6 «) 

— N 2 J dadbp(a,b)H N - 1 (~VNa)H N - 1 (-VNb) (10) 

where p(a,b) denotes one of the Gaussian factors in (||). 

Figure p| sh ows the average number of PSNE as well as its variance as a function of re. For N — ► oo one finds from 
( |io|) that M scales with N 2k ^ 1+k \ so the number of PSNE vanishes for re < 0, at k = there is on average 1 PSNE, 
and for re > the number of PSNE diverges. The increase of the number of PSNE with re may be understood as 
follows. At low re, a and b are anticorrelated, so in the extreme case re = —1 a PSNE corresponds to a site which 
is simultaneously a maximum of the column a^j and a minimum of the row ay = —bif. In the thermodynamic limit, 
such points are exponentially rare. For large re, the condition for a PSNE is much easier to fulfill and in the case 
re = +1, there is always at least one PSNE, namely the maximum entry of Oy = 6^ . 

Note that apart from the point re = 0, the number of PSNE is also self-averaging as the relative sample-to-sample 
fluctuations of M. are found to vanish in the large A-limit. Furthermore for all re > —1 the typical PSNE payoffs to 
leading orders of N are found to be independent of re, they are J N(2lnN — In In N + 0(1)). 
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FIG. 3. Average number M of PSNE for N = 25, 50, 100 (full, dashed, dotted lines). The lines show analytical result @, 
the symbols numerical results, averaged over 1000 samples. Inset: Numerical results for the logarithm of Ai (circles) and of 
A4 2 — M with re = 0.25, 0.5, 0.75 (bottom to top) against In N. The dashed lines show the asymptotic slope 2k/ (1 + re) of M. 



In conclusion we have used methods from the statistical mechanics of disordered systems to describe the typical 
properties of Nash equilibria in bimatrix games with large random payoff matrices. We find that in the thermodynamic 
limit for a randomly chosen Nash equilibrium quantities such as the fraction of strategies played with non-zero 
probability, the self-overlap of the mixed strategies and most importantly the payoff to either player take on a specific 
value with probability 1 . We have analytically calculated these quantities in the annealed approximation as a function 
of the correlation re between the payoff matrices and found good agreement with numerical simulations. Furthermore 
the properties of Nash equilibria in pure strategies were calculated as a limiting case. A number of extensions of this 
work may be considered, including the case where the players have different numbers of strategies at their disposal 
and the generalisation to games of several players. 
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